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Abstract — Although a great deal of research has 
been carried out over the last decade in which many 
different approaches for devising optoelectronic in- 
terconnects have been proposed, more detailed analy- 
sis of the performance parameters such as power, la- 
tency, and area of different optical technologies is yet 
to be done. Optoelectronics offer potential for new 
approaches to the production of VLSI interconnects, 
but previous works have made overly simplified as- 
sumptions for optoelectronic interconnections. In 
this paper, we present a preliminary analysis of the 
system parameters of different optoelectronic inter- 
connect technologies. Based on our analysis, we con- 
clude that the three-dimensional free-space optoelec- 
tronic interconnect network has the best speed area 
product performance compared with fiber-optical in- 
terconnects and optical microelectromechanical sys- 
tems (MEMS) interconnects. Second, although the op- 
toelectronic interconnect offers higher data rates and 
less power per bit compared with electronic intercon- 
nects, the bipolar encoding scheme on the source 
plane and the detector plane means that a larger area 
and volume will be needed. 

Index Terms — 3-D free-space holographic intercon- 
nection network; Optical MEMS interconnect; Truth 
table; VCSEL; Modulators; Detectors; Optical bus. 



I. Introduction 

Optical interconnection research aims to replace 
the electrical interconnection of electronic de- 
vices with optical waves. From physics, we know that 
electrical interconnection and optical interconnection 
[1] take advantage of two different types of physical 
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phenomena. Electrical interconnection [2,3] necessi- 
tates downscaling the feature size of the transistor. 
Although this may increase the computation speed, 
the tighter packaging may also increase the parasitic 
capacitance and signal propagation delay. In addition, 
since the electrical interconnection is treated as a uni- 
form RLC transmission line, the coupling capacitance 
and on-chip self-inductance may lead to additional de- 
lay and cross-talk. This means that electronic inter- 
connects are prone to encountering the bottleneck [4] 
due to computational bandwidth limitations. Also, 
when the computational bandwidth increases, connec- 
tion impedance mismatch may become a critical fac- 
tor. 

Li and Popelek [5,6] performed a volume consump- 
tion comparison of free-space and guided-wave optical 
interconnection. They showed that the single micro 
aperture-per-channel implementations of either 
space-invariant or space-variant operations are more 
volume efficient than their two-cascade counterparts 
and that free-space optics is less volume efficient than 
guided-wave fiber optics. However, their free-space op- 
tical interconnection primarily focuses on point-to- 
point interconnection. If new methods can be used 
successfully to convert the combinational circuits used 
in the electronic interconnect, such that each type of 
combinational circuit can be replaced with holo- 
graphic interconnections and minor electronic cir- 
cuits, the results of the above comparison might be 
very different. 

In addition, many researchers have proposed holo- 
graphic interconnections for free-space optical inter- 
connection [7-12]. With holographic interconnections 
the focus is on interconnection capacity, diffraction ef- 
ficiency, data rates, and power consumption. Although 
holographic optical interconnection is indeed three di- 
mensional in nature [12], the Fresnel lens or holo- 
grams used to interconnect the optical source and de- 
tector array are inherently limited by their 
geometries. Moreover, the Fresnel lens is less energy 
and volume efficient than fiber optics or waveguide in- 
terconnection [5]. 
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Chen et al. [13] demonstrated the functioning of the 
high-speed optical data bus, which is equivalent to the 
VME bus and FutureBus, using a polyimide wave- 
guide as the interlayer dielectric material to form 
three-dimensional optical clock distribution. Their ex- 
cellent work primarily focuses on the clock distribu- 
tion similar to the VLSI system. 

In 1987, Lee [14] proposed a truth-table lookup ho- 
lographic processor. Lee was the first to suggest the 
idea of using dual optical sources and dual detector ar- 
rays in the holographic optoelectronic interconnect 
network. Lee was also the first to suggest using the 
detector's summation circuit, the XOR circuit, and 
comparison circuit in between sources and detectors. 
Lee's method can convert any type of digital input to 
generate a digital output signal. 

Ozaktas and co-workers [15-19] applied the results 
of area-volume complexity theory for solid wires to op- 
tical communication systems. They concluded that the 
optically interconnected electronic circuit represents 
the best alternative given the physical limitations. 

In 1992, Krishnamoorthy et al. [20] discussed a ho- 
lographic imaging optoelectronic interconnection that 
was a slightly modified version of the traditional 4f 
optoelectronic system. The scaling factor in the optical 
lens can be designed so that the detector arrays are 
centered on the source arrays. The MIN and MAX cir- 
cuits [20] are identical to the comparison circuit in 
Lee's design [14]. 

Guilfoyle et al. [21] proposed a truth-table decompo- 
sition calculation method. Drabik [22] published a 
similar result related to "3-D optoelectronic free-space 
interconnects." 

Cho et al. [23] suggest that optical interconnects are 
superior to electronic interconnects because of their 
lower attenuation and lower noise. Ozaktas et al. [24] 
conclude that the multifacet free-space architecture 
can be used to achieve the smallest possible two- 
dimensional optoelectronic interconnect. 

In Section II, we give an overview of optoelectronic 
interconnects mentioning previous works including 
the fiber-optical interconnect, polymer waveguide op- 
tical interconnect, Fresnel hologram optical intercon- 
nect, 3-D free-space optoelectronic interconnect net- 
work, and optical microelectromechanical systems 
(MEMS) interconnect. 

In Section III, we perform mathematical analysis of 
the optoelectronic switching interconnect network 
based on a digital truth table. In this section, we will 
also discuss the algorithm for converting the truth 
table into a format that is suitable for optoelectronic 
holographic implementation. In addition, we discuss 
recent progress on microscale holographic systems as 
well as deriving the power, area, and latency equa- 
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tions for the 3-D free-space optoelectronic intercon- 
nect network system. Simulation results will be pre- 
sented here. 

In Section IV, we discuss recent progress using op- 
tical MEMS devices in optoelectronic interconnects. 
We derive system-related equations for optical MEMS 
interconnects. 

In Sections V and VI, we perform parametric pa- 
rameter comparisons of the area, latency, and power 
consumption among different optoelectronic intercon- 
nect technologies. We also discuss an example of the 
calculation using VCSEL and p-i-n photodiodes. Fi- 
nally, a discussion and conclusions are offered in Sec- 
tion VII. 



II. Overview of Optoelectronic Interconnects 

The physical properties of photons [25,26] are fun- 
damentally different from those of electrons. Photonic 
interconnects must have source-to-detector and 
detector-to-source conversion. To construct large-scale 
digital devices in optics, one needs to implement the 
appropriate level of read-only memory (ROM) or pro- 
grammable logic array (PLA). Any attempts to solve 
this problem at the transistor level or gate level may 
fail. This important process has been overly simplified 
in many previous works. 

The performance of a multiprocessor system [27] 
mainly depends on the bus architecture. The bus ar- 
chitecture is highly related to its bus interconnections. 
Figure 1 shows a system block diagram of a typical 
multiprocessor system. Since the bus performance is 
so dependent on the shorter propagation delay and 
larger bus width, the optoelectronic interconnection 
can play an important role in system-on-chip multi- 
processor design. Figure 2 demonstrates the bus arbi- 
ter circuit [28] in a multiprocessor system. 

The system's interconnection can be through a fiber- 
optical interconnect, Fresnel hologram interconnect, 
3-D optoelectronic switching network, or 3-D optoelec- 
tronic MEMS interconnect. 

If the above circuits are connected using a fiber- 
optical interconnect or Fresnel hologram interconnect, 
then the programmable read-only memory (PROM) 
circuit and bus switching interface inside the bus ar- 
biter circuit must still be implemented with VLSI. 
One end of the bus switching interface is connected to 
the optical source, and the other end of the global bus 
is connected to the optical detector. 

However, if the above circuits are connected using a 
3-D optoelectronic switching network and 3-D opto- 
electronic MEMS interconnects, then the bus arbiter 
circuit and bus switching circuit can be replaced with 
a computer-generated hologram. As a consequence, 
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Fig. 1. (Color online) Multiprocessor block diagram. 

not only does the switching speed of the system in- 
crease, but there also is a larger signal bandwidth and 
reduction in the total consumed power. 



iii. three-dimensional free-space optoelectronic 
Switching Network 

A. Analysis 

Ozaktas and Miller [29] suggested the fundamental 
optical pipeline principle. Figure 3 shows an outline of 
the optoelectronic pipeline architecture. For a syn- 
chronous digital system, the following relationship 
must hold: 



■ hold ' 



■ comb ' 



T-T t 



setup ' 



(l) 



Thus the time delay T comh is constrained by the sys- 
tem clock period T and hold time T hM [2] . If we want 
a shorter period or faster speed for the system clock, 
then we must have a smaller value of T comh . There- 
fore, if the above combinational logic is implemented 
in the digital optical system, the value of T comb is ap- 
proximately equal to 




Input 
Signals 
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Fig. 2. (Color online) Bus arbiter circuit. 
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where N is the number of stages of conversion be- 
tween the electronic and optical signal, t d is the re- 
sponse time of the detector array, and t s is the re- 
sponse time of the optical source array. Ideally, we 
would like to see N become as small as possible. We 
also need to have the smallest values of t d and t s . 

During the 1970s, computer scientists S. L. Hurst 
and C. L. Edward studied the use of digital synthesis 
methods to implement Boolean logic for VLSI designs 
[30-34] . Their method was later found to work only in 
certain Boolean functions, and it cannot cover the 
complete Boolean spectrum. Y.-P. T. Lee [14] and C. C. 
Lee [35] continued to work on this problem during the 
1980s and 1990s. The results are described below. 

Definition: Binary Truth Table for Boolean Logic. Bool- 
ean logic can be represented by a truth table, a Kar- 
naugh map or a canonical form. A binary truth table 
with two stable states {0,1} contains m input Boolean 
variables and n output Boolean functions, and there 
exist 2 m combinations on a single output Boolean 
function. We now have a total of n X 2 m possible out- 
puts. Mathematically, this can be expressed in the fol- 
lowing equation: 

Jk = /k( x l> x 2> x 3> • • • k = 1,2,3, ... ,n. 

Definition: Equivalence of {0,1} and {+1,-1}: The bi- 
nary {0,1} exclusive or operation is equivalent to the 
multiplication of {+1,-1}, as shown below: 



1 0 



X 



-1 +1 



0 1 

1 0 



-1 +1 -1 
+ 1 -1 +1 



Theorem: Boolean Logic Synthesis in the Optical Do- 
main: A binary truth table f(x) with two stable states 
{0,1} or bipolar encoding f(z) with two stable states 
{+1,-1} contains m input and n output functions. 
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Fig. 3. Optoelectronic pipeline architecture and logic timing 
constraints. 

There exist Boolean basis vectors and Boolean spec- 
tral expansion vectors. The inner product of these two 
vectors is equivalent to the original truth table [14]. 
The Boolean spectral expansion vector is calculated 
from the Walsh transform; the Boolean basis vector is 
generated from the sequence in the corresponding 
Walsh transform. The mathematical relationship is 
shown below: 



2 m -l 



[s p (x l )]=—[T m m 



2 m -l 

[Z]= 2 lr p (zi)]®[R p (zi)l 



[R p (z i )] = —[T m ][Z], 



Z{ — 1 — 2xi, 



(4) 



(5) 



(6) 



(7) 



(8) 



where [T 71 ] is the mth-order discrete Walsh transform 
and 0 stands for the inner product. The Walsh trans- 
form can be a Walsh-Hadamard transform, 
Rademacher-Walsh transform, or Walsh-Kaczmarz 
transform. In Eqs. (4)-(8), x t are the truth-table Bool- 
ean variables obtained using {0,1} coding, z t are the 
truth-table Boolean variables obtained using {+1,-1} 
coding, X p (xi) represents the Boolean basis vector ob- 
tained using {0,1} coding, r p (zj) represents the Boolean 
basis vector obtained using {+1,-1} coding, S p {xj) rep- 
resents the spectral expansion vector obtained using 
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Fig. 5. Binary truth table for a two-to-one multiplexer diagram. 



{0,1} coding, and R p {zi) represents the spectral expan- 
sion vector obtained using {+1,-1} coding. 

Three Variable Example: The multiplexer, decoder, 
and arithmetic circuits are important combinatorial 
circuits in a digital design. The multiplexer selects 
one of several input signals to send to a signal output. 
Figures 4 and 5 show a diagram of a multiplexer and 
its binary truth table. This multiplexer can be imple- 
mented with optoelectronics as follows: 

A demultiplexer performs reversed operation of a 
multiplexer, taking a single input to send to one data 
output. The multiplexer and decoder are extremely 
important in any type of bus connection circuits. The 
three input variable truth tables shown below are cho- 
sen as an example of a multiplexer, two-to-four decod- 
ers, and a one-to-four demultiplexer circuit in a free- 
space optoelectronics network. The first column of the 
Z matrix represents a two-to-one multiplexer, the sec- 
ond to the fifth columns represent a two-to-four de- 
coder, and the unused rows are filled with 0 ? s. The 
sixth to the eighth columns represent a one-to-four de- 
multiplexer. The calculation of the spectral expansion 
matrix is shown below: 



Rm 



Rni [v T 




Fig. 4. Two-to-one multiplexer diagram. 



Fig. 6. Detector arrays and comparator. 
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(9) 



where R is the spectral expansion matrix for optoelec- 
tronic connection circuits, T is the Walsh-Hadamard 
transform, and Z is the input Boolean basis vector. 
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Figure 6 shows a diagram of detector arrays, a com- 
parator circuit for single-column truth-table output. 
The inputs of the truth table go through the Boolean 
operation circuit shown in Figs. 7-9 to form the Bool- 
ean basis vector in the optical source arrays. 



For Fig. 6, we derive the following equations for the 
addition of two photodetectors. After properly adjust- 
ing the value of R P1 and R P2 , this circuit can success- 
fully add together the output of the two photodetec- 
tors: 



V + = 1 + 



R 



R 



where 



Nl 
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(12) 



P2 



The second part of Fig. 6 shows a comparator. The dif- 
ferential comparator is an amplification circuit that 
can multiply the difference between two inputs by the 
differential gain. The output voltage and the common 
mode rejection ratio (CMRR) of the comparator can be 
expressed as 
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/y + + y_\ 

V 0 =A d (V + - V.) + A CM \ I , (13) 



CMRR = 201og- -, (14) 

\Acm\ 

where V + and V_ are the two input voltages, V 0 is the 
output voltage, A d is the differential mode gain, and 
A CM is the common mode gain. The example shown 
here only demonstrates three inputs; however, the 
above example can be extended to a truth table with 
four or five input variables. In the case of a truth table 
with five input variables, we may replace the circuit in 
Fig. 8 with that of Fig. 9. A truth table with 10 inputs 
(ra = 10) is equivalent to a 1 K optoelectronic ROM. 

Figures 7-9 show the differential cascade voltage 
switching logic. Using positive feedback to shorten the 
switching time, we can estimate the switching time as 
follows [36]: 

L delay 

^OMRxCoxnN 2 , (15) 

where t deiay is the total time delay in cascade switch- 
ing logic, N is the number of metal-oxide semiconduc- 
tor field-effect transistor (MOSFET) gates, and 
Rn^oxn indicates the process parameters. This is 
about 2.1 for the 50 nm process. Therefore, for the cir- 
cuit shown in Fig. 8, the switching time delay is about 
6.6 ns. 

B. Architecture and Implementation 

In this section, we perform system parameters 
analysis on the 3-D optoelectronic free-space intercon- 
nection network. 

Optical experimental results for dual sources and 
dual detectors have already been demonstrated by 
many laboratories such as the University of Califor- 
nia, San Diego (UCSD) [37-39], Stanford University 
[40,41], Germany [42-45], the University of Arizona 
[46], and the U.S. Air Force [47,48]. 




V DD 




Fig. 8. Circuit to implement a three-variable Boolean basis vector 
[36]. 

For a traditional 4f optical system, the input plane 
is amplitude modulated, and the output detector 
plane is also amplitude (intensity) encoded; therefore, 
the phase information about the filter plane that ar- 
rives on the optical detector plane becomes a noise sig- 
nal [49]. In addition, the phase detector cannot detect 
phase simultaneously with the amplitude detector. 
Consequently, we need to use two separate channels 
[14,50] to encode the positive and the negative signals 
into a coherent optical system [14]. This is shown in 
Fig. 10. Refer to Fig. 11, where each source pixel on 
the optical input plane can be either made with a VC- 
SEL or a spatial light modulator. Note that the VC- 
SEL is normally made with GaAs, and the XOR gates 
are made with the complementary metal-oxide semi- 
conductor (CMOS) process. This means that two dif- 
ferent types of material must be used in the flip chip 
bonding process to combine the two. 

Referring to Fig. 12, the positive pair on the input 
plane is indicated with two positive signs and the 
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Fig. 7. Circuit to implement a two- variable Boolean basis vector 
[36]. 



Fig. 9. Circuit to implement a five-variable Boolean basis vector. 
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Fig. 10. Traditional free-space optoelectronic interconnection. 

negative pair with two negative signs. It is assumed 
that the combinatorial logic has m inputs and n out- 
puts and a total number of N of the connection logics 
running in parallel, so one column will need 2 m+1 pix- 
els in the source array. A total of N columns will need 
NX 2 m+1 pixels in the source array. 

According to the Rayleigh criterion [6], for any 
image-forming optoelectronic system, the minimum 
resolvable distance is related to the operating wave- 
length X, the focus length /*, and the lens diameter D 
by the following equation: 



2.44X/* 



D 



(16) 



According to Fig. 12, assuming the length of each 
pixel is W s , the distance between each pixel is d s , and 
A X or for all XOR gates in the source array, then the 
total area occupied by the source array is 

A s = [W s + (2 m+1 - l)d s ][W s + (2N- l)d s ] +A XOR N. 

(17) 

Apply the same principle to the detector plane as 
shown in Fig. 13. Note that the positive pair and the 
negative pair are pointing in opposite directions due 
to the spatial position exchange of the optical inner 
product. In addition, we need to include the space 
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Fig. 12. Optical input array plane, normally constructed with VC- 
SEL arrays with XOR gates and CMOS driver circuits [14]. 

^adders f° r the adders and A cmps for the comparators on 
the detector plane; therefore, we derive the following 
equations: 



d n = 



2.44X/* 



D 



(18) 



A D = [W D + (2n - l)d D JW D + (2N - l)d D ] + (A cmps 
+A adders )XnN. (19) 

The total occupied area of switching interconnect log- 
ics is 

A 3D =A S +A D . (20) 
The total volume is approximately equal to 



V 3D = -(A s+ A D )x4f. 



(21) 



To calculate the total power consumed by the system, 
assume each pixel of the VCSEL power is P s , and each 
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Fig. 11. Input plane — sources, polymer waveguide clock, and the 
latch. 



Fig. 13. Optical detector plane, normally constructed with photo- 
dector arrays and additional logic circuits [14]. 
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pixel of the detector is P D . The total consumed power 
is 

P 3D = 2 m+2 NP S + 71NX 4P d + P X0R + P cmps + P adders 

« (2 m+2 P s + 4nP d ) X N. (22) 

The total latency delay from the source plane to the 
next plane is close to 



hD -t s + tD + txOR + temps + * adders- 



(23) 



The traditional optoelectronic system can be shrunk 
into a much smaller size as shown in Fig. 14. This mi- 
croscale 3-D free-space interconnect consists of reflec- 
tive mirror layers, two Fourier transform holograms, 
and one connection matrix computer-generated holo- 
gram (CGH). The distance between the Fourier trans- 
form hologram and connection matrix CGH is d. This 
distance is the same as to the second Fourier trans- 
form hologram. The Fourier transform holographic 
lenses can be obtained through either optimized CGH 
[51] or recursive design techniques [52]. With the help 
of a computer-generated hologram [53], the connection 
matrix R in Eq. (10) is stored in the CGH. There are 
three types of computer-generated hologram encoding: 
Lohnmann's method, Lee's method, and Burch's 
method [53]. The hologram is recorded directly onto 
the film with a microdensitometer. More recently, an 
e-beam has been used to perform the same task with 
greater precision. 

In the above analysis of a 3-D free-space optoelec- 
tronic interconnect network, the switching logic sig- 
nals are generated electronically. This makes the en- 
coding bit pattern more accurate and may reduce the 
bit error propagation. 

C. Simulation Results 

As shown in Fig. 6, the receiver circuits for the pro- 
posed optical interconnection network are composed of 
two parts, one for the positive signal and one for the 
negative signal. The received light is first converted 
into electrical signals by the detector arrays and then 
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added together by the two adders. Next, the results 
obtained from the two adders are compared using a 
high-speed analog comparator to generate the truth- 
table output. The design of the adder and comparator 
will be discussed in greater detail below. Finally, we 
summarize the performance of these electronic cir- 
cuits based on the simulation results using a predic- 
tive 45 nm CMOS process. 

We use a simple but effective CMOS two-stage op- 
amp configuration to implement the adder. The struc- 
ture of this op-amp is shown in Fig. 15. This structure 
is appropriate for use in high-speed optical signal re- 
ceivers due to their low noise, stability, and high band- 
width. In this design, we use PMOS as input transis- 
tors to reduce the noise; we also adopt a nulling 
resistance compensation approach to satisfy the sta- 
bility requirement in the closed-loop application. Fur- 
thermore, the phase margin (PM) is made insensitive 
to process and environment variations by using M9, 
M10, and Mil transistors to implement the nulling 
resistor. 

The most important design specification of this op- 
amp is the gain-bandwidth product (GBW). Since the 
op-amp is used in the closed-loop adder and the band- 
width of present-day photodetectors is normally larger 
than 10 GHz, we set the GBW specifications to be 
larger than 10 GHz in this design. Small signal analy- 
sis provides the GBW expression shown below: 



GBW = g ml /C c , 



(24) 



where g ml is the transconductance of transistor Ml 
and C c is the Miller compensation capacitance. The 
phase margin of this op-amp is given by 



PM = 90 ' 



/ gml C L 

arctanl 

\SmS Cc 



+ arctan 



gml ( (W/L) 3 V(w7Z^ 

g m3 \(W/L) n J(W/L) lt 



-1 



(25) 



where g m3 is the transconductance of transistor M3, 
C L is the load capacitance, and (WIL) t is the size ratio 
of transistor M t . Following these two equations, we 
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Fig. 14. Microscale 3-D free-space holographic interconnection Fig. 15. Structure of the CMOS two-stage op-amp with nulling re- 
network, sistance compensation. 
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design the op-amp using the 45 nm process, with the 
assumptions that C L is 20 fF and the PM should be 
larger than 50°. 

The op-amp structure is simulated in Hspice and 
some typical results are shown below. The op-amp 
open-loop frequency response is shown in Fig. 16. We 
can see that the DC gain of this op-amp is about 42 
and the GBW is 10.2 GHz. To verify the insensitivity 
of the phase margin, we simulate the op-amp open- 
loop characteristics under three different tempera- 
tures. The results show that the PM is maintained in 
the range of 47°-50°. To verify the functionality and 
measure the performance data, we perform a closed- 
loop transient simulation like that shown in Fig. 17. 
The op-amp is connected with resistors as shown in 
Fig. 6 to form a small signal adder, and two 10 GHz 
pulse signals with different amplitudes (50 and 
100 mVpp) are added at the inputs. In the simulation, 
we measure the delay of this adder from the input 
peak value to the output peak value to be around 
90 ps; the power dissipation in this scenario is 
260 /xW. 

The high-speed comparator is implemented using 
the structure shown in Fig. 18. This structure cas- 
cades through several stages to achieve high-speed 
comparison. The first stage is a differential pair with 
diode-type loads designed to provide low gain but high 
bandwidth. In the second stage the positive feedback 
technique is adopted to amplify the signal difference 
with higher speed. The signal will be amplified again 
by a self-biased differential amplifier in the third 
stage and finally will become a full-swing digital sig- 
nal buffered by an inverter. Still, the bandwidth of the 
comparator should be larger than 10 GHz as dis- 
cussed for the op-amp. During designing, we need to 
tune the first stage to achieve a high bandwidth with 
lower but reasonable gain. 

The designed comparator circuit is also simulated 
with Hspice using the same 45 nm process model. Fig- 
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Fig. 16. Open-loop frequency response of the designed op- amp. 
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ure 19 shows the DC transfer characteristics of this 
comparator. From the figure, we can see that the com- 
parator has a DC voltage gain of about 90, correspond- 
ing to a 10 mV resolution voltage, and the offset volt- 
age is about 5 mV. A transient simulation is also 
performed to verify that the circuit is functional at a 
speed of 10 GHz. The delay of the comparator is 44 ps 
for a 50 mV input and 58 ps at 10 mV input. The cor- 
responding power dissipation is 470 and 520 julW, re- 
spectively. 

Finally, we put all the components together and per- 
form an overall simulation to demonstrate the func- 
tionality of the receiver system and also measure the 
performance data. The verification experiment is per- 
formed by adding four pulse inputs with different am- 
plitudes (25, 50, 75, and 100 mV) and different pseu- 
dorandom binary sequence (PRBS) patterns. The cycle 
time of the signals is 100 ps (10 GHz). The output of 
the system with this type of random input is shown in 
Fig. 20. We can see that the output waveform cor- 
rectly tracks the difference in the two adders' outputs. 
Although the delay and power dissipation of this sys- 
tem is data dependent, we measure some different 
points to find typical values. The delay is around 
50 ps, with 53 ps for the rise delay and 47 ps for the 
fall delay, whereas the power dissipation is about 
955 julW including the two op-amps and one compara- 




Fig. 18. Structure of the CMOS high-speed comparator with buff- 
ered output. 
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Fig. 19. DC transfer characteristics of the designed comparator. 



tor. We can also estimate the total chip area occupied 
by this electronic receiver. In our design, the two op- 
amps and one comparator cost 2x0.15 /mm 2 
+ 0.61 /xm 2 = 0.9 /mm 2 . 



iv. three-dimensional optoelectronic mems 
Interconnect Analysis 

In this section, we perform system parameter 
analysis of the optoelectronic MEMS interconnection 
system. 

invented the world's first MEMS, the deformable 
mirror device (DMD). Lin and Feldman [54] proposed 
using DMDs and CGHs to make an optical MEMS in- 
terconnect system. Their system diagram is illus- 
trated in Fig. 21. In the figure, arrays of microma- 
chined deformable mirrors are arranged on a silicon 
substrate. 

The optical source in this diagram can be the VC- 
SEL. The optical source sends a light beam vertically 
to the CGH. The CGH diffracts a total of six beams of 
light onto two DMDs and four optical detectors on the 
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Fig. 21. Optoelectronic MEMS interconnect diagram. 

substrate. The beams are again reflected from the 
DMDs back to the second spot B on the CGH, so that 
the CGH diffracts four light beams onto four detectors 
PD1, PD2, PD3, and PD4. If the diffracted light beam 
is on the same diffraction order, then there is no phase 
change; otherwise, there is a 180 deg phase change 
between the first and the second diffraction order 
among the reflected beams. 

Since the DMD is a phase modulator, let the phase 
of the optical source be <f> 0 . Then the phase at B is 
(f>o+(f>i, the phase at C is 0 O + ^-i- cf> 2 , and the phase at 
D is 0o + ^2? where and cf> 2 are the phase shift of the 
first and the second DMDs. Assume that DMD1 is 
turned off, then the photodetector PD1 receives a con- 
structive light beam, PD2 receives a destructive light 
beam, PD3 receives a constructive light beam, and 
PD4 receives a destructive light beam. Assuming that 
DMD1 is turned on, the DMD1 modulates the phase 
by 180 deg. In this case photodetector PD1 receives a 
destructive light beam, PD2 receives a constructive 
light beam, PD3 receives a destructive light beam, 
and PD4 receives a constructive light beam. From 
each point A, B, C, and D on the CGH, each input light 
beam is further divided into four beams, and each one 
will focus on the photodetectors PD1, PD2, PD3, and 
PD4. If we use +1 to represent an in-phase beam at 
the photodetector and -1 to represent the 180 deg out- 
of-phase beam at the photodetector, then we can con- 
struct a truth table of interconnection patterns as in 
Table I. If all four photodetectors are in phase with 
the optical source, this means that the connection is in 
the "on" state, otherwise, if only two photodetectors 
are in phase, then the other two photodetectors are 
180 deg out of phase with the optical source, and this 
means that the connection is in the "off state. 

The movable portion in the MEMS interconnection 
system uses the spring structure. The fixture beam 
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TABLE I 

Two Input Variables Showing the Truth-Table Interconnection Pattern for the DMD-CGH Optical MEMS Inter- 
connection System 



DMDl 



DMD2 



PD1 



PD2 



PD3 



PD4 



State 


State 


A 


B C 
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B C 
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A 


B C 


D 








0 
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+ 1 


+1 +1 


+1 


+1 


-1 -1 


+1 


+1 


+1 -1 


-1 


+1 


-1 +1 


-1 


OFF 


OFF 




ON 






OFF 






OFF 






OFF 
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+ 1 


-1 -1 


+1 


+1 


+1 +1 


+1 


+1 
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-1 


+ 1 
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OFF 
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-1 


+1 
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-1 


+1 
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+ 1 


+ 1 


-1 -1 


+ 1 


OFF 
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OFF 






OFF 






ON 






OFF 
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+ 1 


-1 +1 


-1 


+1 


+1 -1 


-1 


+1 


-1 -1 


+ 1 


+ 1 


+ 1 +1 


+ 1 


ON 


ON 




OFF 






OFF 






OFF 
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DMD is a MEMS device with four cantilever hinges at 
right angles. The fixture beam DMD primarily per- 
forms phase modulation of light. The three- 
dimensional DMD structure is shown in Fig. 22. The 
DMD device has flexure hinges at each end. One end 
attaches to the support structure, and the other end is 
attached to the phase plate. This principle is the same 
as the electric current reflector; both fixed sided 
beams are distorted and rotated along the direction of 
the hinges. Since MEMS can produce periodical me- 
chanical oscillations, the periodical oscillation fre- 
quency is called the resonant frequency. The resonant 
frequency of the DMD with the flexure hinges on each 
end can be expressed as 



fo- 



D 2 h 
2^L 2 



E 

Yip 



(26) 



Therefore, the response time of the DMD can be ap- 
proximated by the inversion of Eq. (26): 



''DUD - 



2irL 2 



Yip 
~E~' 



(27) 



where D is a constant, normally, D = 0.48 at the lowest 
resonant frequency; E is the elasticity coefficient; L 
and h are the hinge's length and thickness, respec- 




tively; and p is the material's density. According to Lin 
[54,55], actual measurement of the DMD's response 
time indicates a rise time of 3.5 /uls and a fall time of 
2.5 /uls at 180 deg of phase modulation. 

In general, if we set up the same interconnection I 
columns on the silicon substrate, for a two input vari- 
able truth-table optical MEMS interconnection, the 
total required power can be expressed as 

P = (P s + 2P DMD + 4P D )l. (28) 

The optical MEMS interconnection occupies an area of 

W base =A S + 4A D + 2A DMD + 6d , (29) 



*>MEM 



~ ^ ' basely 



(30) 



where d is the space between the detector and the 
DMDs or the source and the detector. The total la- 
tency delay of the optoelectronic MEMS interconnect 
is 



t = t s + t D + 2t 



DMD- 



(31) 



In our analysis, note that the truth-table outputs of 
optical MEMS interconnects are generated optically. 
Also, the above example only contains two inputs. 
When the number of input lines increase to three or 
four, more reflective surfaces and DMDs are involved, 
which increases the possibility of bit error propaga- 
tion in the optical data path. 



V. Latency, Area, and Volume Comparison 



In summary, we arrive at the following conclusions 
in this section. Previously, Li [5,6] did important work 
analyzing fiber-optical interconnects. There has also 
been much work done on Fresnel holographic inter- 
connects. 



Fig. 22. DMD single-cell diagram. 



Assume that there are N c interconnect points, then 
the area occupied by the fiber-optical interconnect is 
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A 0F = N C (A S + (32) 

where A s is the area occupied by a source, A D is the 
area occupied by a detector, and A dectronic is the area of 
all electronic components. Therefore, the total latency 
delay is 

t 0 F =ts + tD + t electronic • (33) 

The physical principle connecting the different VLSI 
electronics by the Fresnel hologram is very similar to 
the fiber-optical interconnect. Therefore, we have the 
two following equations: 

^Fresnel = N, c(A S + A D ) + A dectronic , (34) 
tfresnel-ts + tD + t electronic (35) 

We summarize the comparison results for the area 
and latency of different optoelectronic interconnects in 
Table II. 

Consider an example of a calculation comparing the 
three types of interconnects. We choose to use 
Finisar's VCSEL [56] as the single pixel's source. Its 
rise time and fall time are each 40 ps. The area occu- 
pied per source pixel is 0.250 mm 2 . We use Alphalas' 
ultrafast photodetector [57] as the detector. Its re- 
sponse time ranges from 30 to 500 ps. We choose to 
use 50 ps as our calculation base and 0.250 mm 2 for 
the single detector's area. Assume the use of only four 
inputs for interconnection. Then the occupied area for 
both fiber-optical and Fresnel hologram interconnects 
is about 2.0 mm 2 plus the electronic component's size. 
At present, a 45 nm CMOS process for a complex bus 
switching logic should be less than 1.0 mm 2 , so the to- 
tal area is about 3.0 mm 2 . For both fiber-optical and 
Fresnel hologram interconnects, the total latency will 

TABLE II 

Comparison of Area and Latency of Different Opto- 
electronic Interconnects 



Optoelectronic 



Interconnects 


Area 


Latency 


Fiber-optical 


A 0F =N C (A S +A D ) 


t()F = ts + t D + 1 electronic 


interconnect 


A electronic 




Fresnel 


A fresnel =N c (A s +A D ) 


tfresnel = ts + t D 


hologram 


A electronic 


t electronic 


interconnect 






Free-space 


V 3D =l(A s+ A D )x4f 


tsD = t S + tD + 


optoelectronic 




txOR + temps + 1 adders 


interconnect 


A=[WM^ +1 -Dds\ 




network 


X[W s + (2N-l)d s ] 






+A X0R N 






A D =[W D + (2n-l)d D ] 






X[W D + (2N-l)d D ] 






+(A cmps +A adders )XnN 




Optical MEMS 


W base =A s + 4A D 


t = t s + t D + 2t DMD 


interconnect 


+2A DMD +6d 






AMEM=W oase l 





be 90 ps plus the electronic component's delay t e i ectronic . 
According to the International Technology Roadmap 
for Semiconductors (ITRS) [58], t e i ectronic is about 
0.7 ns (700 ps), t eiectronic may reduce to 0.2 ns (200 ps) 
by the year 2022. If we use 2 mW per VCSEL [56] and 
1 mW per detector [57] for calculation, the total con- 
sumed power is approximately equal to 13 mW. How- 
ever, for a 3-D free-space optoelectronic interconnect 
network, the total occupied area (one column only) is 
approximately equal to 10 mm 2 + 6 mm 2 = 16 mm 2 , the 
total latency will be 140 ps + 6 ns (XOR gates), and the 
total consumed power will be approximately equal to 
147.8 mW. However, if we use the optical MEMS in- 
terconnects, the latency will be in the 12 juls range. 
This is very slow compared with other types of inter- 
connects. Note that this data is only a preliminary cal- 
culation, and the technology parameters vary from 
time to time. 

VI. Power Consumption Comparison 

To compute the power consumption, assume that 
there are N c interconnect points. The power required 
by the fiber-optical interconnect is then 

P 0 F = N C (Ps + Pd) + P electronic > (36) 

where P s is the power required by a source, P D is the 
power required by a detector, and P e iectronic is the 
power consumed by all electronic components. 

The physical principle to connect the different VL- 
SIs using the Fresnel hologram interconnect is very 
similar to the fiber optical interconnect method. 
Therefore, we have the following equation: 

P Fresnel = N c(Ps + Pd) + P electronics • (37) 

We summarize the comparison of the power consump- 
tions of different optoelectronic interconnects in Table 
III. Symbols used are shown in Table IV. 

VII. Discussion and Conclusion 

Silicon photonic interconnects have received a lot of 
attention recently [59-65]. Miller [59] made a signifi- 
cant contribution to compare electrical and optical in- 
terconnects and examine the requirements for opto- 
electronic and optical devices for future silicon chips. 
Moreover, silicon photonic interconnects have been 
demonstrated successfully in intrachip clock distribu- 
tion [62]. However, we have not discovered a revolu- 
tionary approach such that an entire VLSI chip can be 
replaced by optoelectronics [65] . New research may be 
required to study power loss and the optical bit error 
rate in silicon photonic interconnects. 

In this paper, we derive and prove the algorithm to 
be used in a 3-D free-space optoelectronic interconnec- 



218 J. OPT. COMMUN. NETW./VOL. 2, NO. 4/APRIL 2010 



Y.-P. Lee and Y. Zhang 



TABLE III 

Comparison of Power Consumptions of Different Opto- 
electronic Interconnects 



Optoelectronic 
Interconnects 


Power Consumption 


Pi l">PT*-onti pal 


l Qp — iM(j\r g-r l d) t l electronic 


interconnect 




Fresnel 


P Fresnel = N. c(Ps + Pd) + P electronic 


hologram 




interconnect 




Free-space 


P 3D = 2 m+2 xNP s +nNx4P d+ P XOR+ P cmps+ P adders 


optoelectronic 


~(2 m+2 XP s + 4nP d )xN 


interconnect 




network 




Optical MEMS 


P = (P s + 2P DMD + 4P D )l 


interconnect 





tion network. We introduce sophisticated mathemati- 
cal equations for different types of optoelectronic in- 
terconnects. Among the four types of optoelectronic 
interconnects, according to our preliminary analysis, 
both the fiber-optical interconnects and the 3-D free- 
space optical interconnect network have the potential 
for future short-distance interconnection. Since the 
bus switching logic's signals are generated electroni- 
cally, this may reduce the bit error propagation. In ad- 
dition, the bus switching logics are inherently embed- 
ded inside the hologram. This can further reduce the 
size and consumed power, while at the same time in- 



TABLE IV 

System Symbols Used in the Calculations 



Nomenclature 


Explanation 




Total latency delay in 3-D optoelectronic 




interconnection network 


t 0F 


Total latency delay in fiber-optical 




interconnections 


tfresnel 


Total latency delay in Fresnel hologram 




interconnect 


t s 


Response time per source pixel (VCSEL or 




modulator) 


t d 


Response time per detector 


tDMD 


Response time per DMD device 


\ 


Optical wavelength 


D 


Lens diameter 


N c 


Number of connection points for point-to-point 




interconnects 


Ps 


Power consumed per source pixel 


Pd 


Power consumed per detector 


Pdmd 


Power consumed per DMD pixel 


As 


Area occupied per source pixel 


Ad 


Area occupied per detector 


Admd 


Area occupied per DMD pixel 


■^■electronics 


Total area occupied by electronics 


p 

1 electronics 


Total power consumed by electronics 


m 


Number of input columns in truth table 


n 


Number of output columns in truth table 



crease the computational bandwidth. However, due to 
the bipolar encoding scheme, this technology has a 
larger power consumption and occupies a larger area 
than the traditional electronic interconnect. 
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